A Vector Space Model for Syntactic Distances Between Dialects
نویسندگان
چکیده
Syntactic comparison across languages is essential in the research field of linguistics, e.g. when investigating the relationship among closely related languages. In IR and NLP, the syntactic information is used to understand the meaning of word occurrences according to the context in which their appear. In this paper, we discuss a mathematical framework to compute the distance between languages based on the data available in current state-of-the-art linguistic databases. This framework is inspired by approaches presented in IR and NLP.
منابع مشابه
Syntactic structure and geographical dialects in the songs of male rock hyraxes.
Few mammalian species produce vocalizations that are as richly structured as bird songs, and this greatly restricts the capacity for information transfer. Syntactically complex mammalian vocalizations have been previously studied only in primates, cetaceans and bats. We provide evidence of complex syntactic vocalizations in a small social mammal: the rock hyrax (Procavia capensis: Hyracoidea). ...
متن کاملGlobalization, Standardization, and Dialect Leveling in Iran
This paper is an attempt to shed light on the effects of modernization, urbanization, monolingual educational system, and mass media as well as the process of globalization on dialect leveling among Persian dialects. In so doing, the first part of the paper elaborates on the relationship between globalization and sociolinguistics, and on the concept of standardization. Also, it discusses some ...
متن کاملIdentification of negated regulation events in the literature: exploring the feature space
Background. Regulation events are of critical importance to researchers trying to understand processes in living beings. These events are naturally complex and can involve both individual molecular entities and other biomedical events. Of equal importance is the ability to capture statements that refer to regulation events that do not take place. In this paper we explore the identification of n...
متن کاملPerceptive evaluation of Levenshtein dialect distance measurements using Norwegian dialect data
The Levenshtein dialect distance method has proven to be a successful method for measuring phonetic distances between Dutch dialects. The aim of the present investigation is to validate the Levenshtein dialect distance with perceptual data from a language area other than the Dutch, namely Norway. We calculate the correlation between the Levenshtein distances and the distances between 15 Norwegi...
متن کاملMeasuring Syntactic Distances between Dialects: A Web Application for Annotating Dialectal Data
• 15:00-16:30 Session Chair: Maurizio Messina, Biblioteca Nazionale Marciana, Venezia • 15:00-16:00 Invited Talk: Sapienza Digital Library Tiziana Catarci, Marco Schaerf Dipartimento di Ingegneria Informatica Automatica e Gestionale “Antonio Ruberti”, Sapienza Università di Roma • 16:00-16:30 Invited Presentation: Digital Cultural Heritage Projects Opportunities and Future Challenges Rossella C...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014